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Abstract. Let © be a smooth compact oriented manifold without boundary, 
imbedded in a eucHdean space E*, and let 7 be a smooth map of into 
a Riemannian manifold A. An unknown state S £ is observed via X = 
6 + where e > is a small parameter and § is a white Gaussian noise. 
For a given smooth prior A on and smooth estimators g{X) of the map 7 
we derive a second-order asymptotic expansion for the related Bayesian risk. 
The calculation involves the geometry of the underlying spaces and A, in 
particular, the integration-by-parts formula. Using this result, a second-order 
minimax estimator of 7 is found based on the modern theory of harmonic maps 
and hypo-elliptic differential operators. 

1. Introduction 

In many estimation problems, one has a state which lies on a manifold but one 
observes this state plus some error in a euclidean space. It is desirable to utilise 
the underlying geometry to construct an estimator of the state. The present paper 
uses a Bayesian approach to construct asymptotically minimax estimators along 
with the least favourable Bayesian priors. 

The use of differential geometry in optimal statistical estimation has a long 
history, as documented in a recent article "Information geometry" on Wikipedia, 
for example. Early applications of differential geometry to the derivation of second- 
order asymptotic properties of the maximum likelihood estimates are summarized 
in [T]. However, a rigorous approach to second-order optimality requires a decision- 
theoretical framework. This approach was developed in [6l [2j [7] and a number of 
subsequent publications. 

In some cases, one is interested in the second-order optimal estimation of a given 
function of parameters. For an early application of this approach see [5]. As a gen- 
eral rule, such problems require more sophisticated differential-geometric techniques 
such as the theory of harmonic maps and hypoelliptic differential operators [31 13] . 

Consider the following situation: E is a real s-dimensional vector space with 
inner product a and Q (resp. A) is a smooth manifold with riemannian metric g 
(resp. h). Assume that the smooth riemannian manifold (6,g) is isometrically 
embedded in a euclidean space (E, a) via the inclusion map i, and e A is a 
smooth map. Smooth means infinitely differentiable. These data are summarized 
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by the diagram 

(E,a)^ 

(e,g) ^- — ^(A,h), 

Suppose that X € E is a gaussian random variable with conditional mean 6 E & 
and covariance operatoiQ e^c, i.e. 

A basic statistical problem is to determine an estimator of "7(A")," by which we 
mean an optimal extension of 7 off 0, in the minimax sense. To make this precise, 
let 5 : E ^ A be an estimator, and let dist be the riemannian distance function of 
(A, h). Define a loss function by 

(1) Rei9,9)^ f dist(g(x),7(e))2^,(a;-i(0))dx, 

where | • | is the norm on E induced by cr, il^e{u) = exp(— |up/2e^)/(27re^) 2 and da; 
is the volume form on E induced by ctQ Define the associated minimax risk 

(2) r,(e)=inf sup 

1.1. Results. The present paper takes a Bayesian approach to the problem of 
determining the asymptotically minimax estimator g. In Bayesian statistics, the 
point is viewed as a random variable with a prior distribution X{9)d0 where 
JggQ A(0) d0 = 1 {d9 = dvg is the riemannian volume of (0,g) ). The Bayesian 
risk of a map g is 

(3) R,{g;\)^ f f dist{g{x),j{9))^X{9)Mx~^{0))dxdd. 

A Bayes estimator 5 : E ^ A is a map which minimizes the Bayesian risk over 
all maps. In Theorem 14. 1[ an expansion in e of the Bayes estimator g^, for a fixed 
Bayesian prior, is computed. The constant term in is 7 o tt, where tt : NO — > Q 
is the projection map of the normal bundle of C E. The order-e^ term in g^ is 
composed of two parts: the first part is independent of the Bayesian prior and its 
contribution is to reduce the energy of 7; the second part is due to the gradient of 
the prior A and it tries to move the estimator in the direction which maximizes A. 
Theorem 14. II also computes the Bayesian risk Re{ge', A) of g^ up to O(e^). 

The results of Theorem 14.11 are used to obtain "the" optimal Bayesian prior. 
There arises a number of interesting problems of a statistical nature in this regard: 
foremost is the problem of deciding what should be the flat Bayesian prior. Given 
a flat Bayesian prior, it is shown that the 2nd-order optimal Bayesian prior satisfies 
an eigenvalue problem. This leads to a second difficulty: in general, the leading 
term in the Bayesian risk is determined by |d7p and is therefore largely independent 



By convention, the covariance operator is the induced inner product on the dual vector space 
E*. If we regard as a hnear isomorphism of E — > E*, then the covariance operator is the inverse 
linear isomorphism c = cr^^ :E* — >E. 

^One can introduce a cr-orthonormal coordinate system Xi on E. In this case, |x'p = xf 
and dx = dx'i A ■ • • A dxs . 
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of the Bayesian prior. Thus, Theorems 15 . Iff5?4l give several criteria for second-order 
optimal Bayesian priors. 

1.2. Note to Reader. The present paper resulted from the work of B. Levit [6l 
[2j[71[5]. This work, and early drafts of the present paper, were done largely in local 
coordinates using Taylor series. This proved to be both daunting, difficult and 
unsatisfying because we were forced to assume that A was isometrically embedded 
in some euclidean space and use the ambient distance function. Paradoxically, these 
computations produced estimators which did not take values in A. 

The problem with the answer these computations produced was obvious, the 
reason for the problem was less so. The ultimate reason is that the Taylor series 
expansion of a function is not a tensorial, or intrinsic, object. Rather, a Taylor 
series depends on the geometry of the domain and range of the function: it is, in 
other words, a geometric object. It is easy to see why this is: a Taylor series requires 
the notion of a second derivative to be defined, but it is well-known that a second 
derivative can be defined only with the aid of an affine connection-a geometric 
object. Calculations with Taylor series in local coordinates masked this fact and 
completely mislead us. 

This is a roundabout way of explaining the extensive geometric formalism used 
in the present paper. We hope that the reader will remember that behind this 
formalism is a simple aim: to define a Taylor series in a rigorous and useful way. 
As a by-product, the answers that result can be stated in a much more compact 
way. 

This paper proceeds as follows: in section [2l a theory of Taylor-Maclaurin series 
is developed for riemannian manifolds and several useful curvature and integration- 
by-parts formulas are developed that are used in seqsequent sections; section [3] 
discusses the existence and uniqueness of a Bayes estimator; section 2] utilizes the 
theory developed in section [5] to expand the Bayesian risk functional and deter- 
mined the Bayes estimator up to O(e^); section [5] develops criteria for second-order 
optimal Bayesian priors in terms of the sub-laplacian of a naturally constructed 
sub-riemannian structure; section [6] computes the examples where 7 is a riemann- 
ian immersion or submersion, which includes the cases where 7 the identity map of 
Q and the inclusion map t of 8 C E. 

Throughout, it is assumed that 8, A are a compact, connected, boundary less 
smooth manifolds. 

2. Maclaurin Series 

This section develops a theory of Maclaurin series of a map between riemannian 
manifolds, then it exposes some useful formulas from riemannian geometry that are 
used in subsequent sections. First, it is useful recall some constructions. 

2.1. Induced metrics. Let X and Y be real inner-product spaces. The vector 
space of linear maps X ^ Y is denoted by Hom(X; Y). Define the inner product 
of linear maps A,B^ Hom(X; Y) by 

{A, B) := ^(A.e„ S.e,} = Tr(A'B), 

i 

where Ci is an orthonormal base of X. The Hilbert-Schmidt norm of a linear map 
is defined in the natural way from this inner product. By construction, if a; G X, 
then \A.x\ < \A\\x\. 



4 



BUTLER, LEVIT 



We can make X® (the n-fold tensor product of X with itself) into a real inner- 
product space by defining 

{ai® ■ ■ ■ ® anM® ■ ■ ■ ®K) = {ai,bi) ■ ■ ■ (a„, 6„}, 

for ai ,bi G X and extending by bi-linearity. The previous construction makes 
Hom(X® , Y) into a real inner-product space. We will use these constructions 
henceforth without further comment. 

2.2. Maclaurin series. Let {M,g) and {N,h) be riemannian manifolds without 

boundary and let M be a smooth map. For x ^ AI and y ~ (t>{x), let T^M 

(resp. TyN) be the tangent space to M (resp. N) at x (resp. y). The exponential 
map exp^ (resp. exp^^) of g (resp. h) is injective on a disk of radius a — a{x) in 
TxM (resp. s — s{y) in TyN), while the tangent map of (j) at x, d^cfi, maps a disk 
of radius t into a disk of radius t x \da:4>\. If r = r(x) is defined to be the minimum 
of s{y)/\dx4>\ and a{x), then there is a commutative diagran|| 

(T^M^g.) ^ ^iT^N,hy) 



<^^Py i.e. (f = exp„ ^ o(j) o exp^ 



(M,5) ^ ^{N, h), 

where T^M (resp. TyN) is the disk radius r (resp. s) in T^M (resp. TyN) centred 
at 0. The map if is & smooth map between open subsets of euclidean spaces that 
maps to 0. Its Maclaurin series expansion is well-defined and can be written as 

(4) (p{v) = d(p{v) + ivd(^(f , v) + ^V^d^iv, V, v) + 0{\v\^), 

for all V € T^M. The hessian Vd(p may be understood as the ordinary second 
derivative of a map between vector spaces, as can V^d(p. However, Lemma 12.11 is 
essential and relates the derivatives of dip to the covariant derivatives of d(p [3] . 

Lemma 2.1. Let v e T^M. Then V^dip{v, . . . ,v)\^ = V^d(j){v, . . . ,v)\^ for all 
k > and all x £ M . 

Proof. Since exp^, oip = (f)oexp^ on the open set T^M, it follows that V'''d(expj^ °'^) lo 
V'^d((/) o exp^)|^. It suffices to show that the lefthand side equals V'^dc/jlp and the 
righthand side equals V'^d^]^ when each are evaluated at {v,. . . ,v). The chain 
rule, along with do exp^, — idT„Af, shows that 

V^-d(expyO^)|^= VM^Ip-HT, 

where T is a sum of terms which are composition of forms V'd(/3, V™dexpj^ with 
l,m < k and m > 1. It therefore suffices to show that 
Claim. V™dexpj,|g = for all m > 1. 

Let V e TyN = To{TyN), let c{t) — expy{tv) be the unique geodesic passing through 
f, and let m„{t) = tv be the multiplication-by-u map. Since c{t) = exp^omi,(t), 
V = dm„(9() and Vdmi, — 0, we have that V c(t)c{t) — Vdc(9t, dt) = Vdexp2,(w, v). 
Thus Vdexp^(i;, ti)|o = 0. 

In the general case, form > 2, V"'dexpj^(w, . . . ,v)\^ — ^ c(t) (' ■ ' (Vc(t)c(i)) ' ' ■ 
Since the innermost term vanishes identically in t, the whole expression vanishes 



t=o' 



^The functions a and s may be assumed to be smooth. 
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The proof for (j) o exp^. is similar. □ 
Lemma 2.2. For all v G T^M , 

(5) o cxp Jw) = exp^(^) (^d(/.(t;) + ivd(/)(v,w) + iv'd0(w,t;,t;)+O(r4)^ . 

Remarks. (1) In general, the exponential map of {N, h) is not a global diffeomor- 
phism. Consequently, ip may not be globally well-defined and its Maclaurin series 
([5]) need not converge globally. The conjugate points of exp^, are obstructions to 
global convergence. If {N, h) is a simply-connected, non-positively curved manifold, 
then expj, is a global diffeomorphism and (p is globally defined. (2) If w G TxM 
is a gaussian with covariance operator e^gx, then, since ip is defined on an open 
neighbourhood of and e is small, its expected value is essentially well-defined and 
equals, by Lemma [23] and equation (|4]), 

ie2r(0).-|-O(64) 

where r((/)) is the trace of the hessian Vd0. Riemannian geometers call r((/)) the 
tension field of (j). The tension field has an interesting interpretation: if one views 

(6) / jd^l'^di^g (di^g = riemannian volume form of g) 

J M 

as the energy of the map (/>, then (p i— > —T{(j)) is the gradient of this functional. It is 
known that r is a semilinear elliptic differential operator that is analogous to the 
laplacian [3] . If one inspects the formula for the bayesian estimator g^ in Theorem 
14.11 one observes that-neglecting the contribution of the prior A-the contribution 
of the 5t((/)) is to move the estimate 'j{9) in the direction which reduces the energy 
quickest. Indeed, if one includes both contributions, then their combination can 
also be viewed in this fashion, but the energy functional depends not on g and h 
but u ■ g and h where the conformal factor it is a fractional power of A. 

2.3. The Ricci Tensor. This section provides the key inputs to the proofs of 
Lemma 14.71 and Proposition 14.81 by proving Lemmas 12.41 12.51 and the integration- 
by-parts formula in Lemma 12.81 To do this, one must make an excursion into the 
riemannian geometry of some naturally occurring vector bundles. In this section, 
(M, g) and {N, h) are riemannian manifolds, possibly with boundary, and (j) : M ^ 
N is a smooth map. 

Let Hom(TM; 4>*TN) be the vector bundle of fibre-hnear maps between TM 
and and (l)*TN; a fibre Hom(TM; (/)*TA^)^, a; G M of Hom(rM; 0*T7V) is the 
vector space of linear maps from T^M to T^^x}N. That is, IIom(rM; (j)*TN)^ = 
'Rom{TxM]T^^^)N). One can view d(/) as a smooth section of Hom(rAf; 0*riV). 
There is a natural metric connection on Hom(TM; (f)*TN), which is denoted by V or 
yHom(TAf;</,*TAr)^ that is induced by the (Levi-Civita) connections on TM and TN 
respectively. Consequently, Vd0 is a smooth section of T*M Hom(TM; (jfTN). 
This latter vector bundle admits a natural metric connection, in turn, and VVd0 — 
V^d^ is then a smooth section of T*M(g)T*M«)Hom(TM; (j)*TN). In other words, 
V^d(/) is a 2-form with values in the vector bundle IIom(TM; 4>*TN). This 2-form 
has a unique decomposition into a symmetric and anti-symmetric part, viz. 

V2d0(x, y) = i {Vly + Vl^) d0 + i {Vly - Vl^) d0. 
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where x, y are vector fields on AI and yd(f> = VxC^ydcj)) — Vv^yd(/). Twice the 
anti-symmetric part of V^d0 is the curvature tensor of (Hom(rA/; (p*TN), V) and 
is written a^ 

(7) R.,yd<l> = i^ly - V2 d</>. 

There is a naturaUy-defincd Ricci tensor associated to the curvature R. Let ej be 
an orthonormal frame. Then for any tangent vector x 

(8) Ricd(i,{x) = ^ Rx.cjdcf) ■ ej 

j 

which is easily seen to be independent of the choice of orthonormal frame. The 
Ricci tensor Ricd^ is a section of Hom(rM; (j)*TN), like d4>. 

The metric on Hom(TM, (j)*TN) and associated bundles is described in section 

Lemma 2.3. Let v S TpM be a gaussian with covariance operator (=metric) gp 
and expected value 0. The expected value of 

(1) V ^ |d(/i(u)p equals |d0p; 

(2) iVd^^w,^)^ equals |r(0)|2 + 2| Vd(/)|2; 

(3) w (d0(w), V^d0(u, w, w)) equals 

J2mei),^%,e,d^ ■ + (V2^,e, + ^l,e.) del) • e,>, 

where is any orthonormal basis of TpM. 

It is recalled that the tension field r((/i) equals Vd(/)(ei, e^) and is a section of 
(j)*TN with its induced norm. The norm of the second fundamental form Vd0 is 
the norm of a section of T*M® Hom(rM; (/)*rAf), so |Vd0|2 = E^.j | Vd(/)(e„ ej)^. 

Proof. A simple calculation. □ 

Lemma 2.4. 

Y,{'d^{e,),V%^,A^-e,+ (vl^,^+V%^,}j d(j,-ej)p - (d,^, 3Vt(0) - 2RiCd0)p. (*) 

Proof. Let be an orthonormal frame at p and let ] denote the left-hand side of 
(*). The subscript p is dropped in the following. A computation yields 

Ve,,e,d0 • ej = V^^. g^d0 • e, Vi, j. 

If ^(d(/)(ei), Vg^ g.d0 • Cj) is added and subtracted to f , then one obtains 

t = 3W(e.), %,e,d4> ■ e,;) + (d(^(e,), i?e..e,d0 • e,), 
hi 

which simplifies to 

t = (d</>,3Trace(V2d0) + Ricd^), 
where Trace(V2d0) = V^^. e^d(/). The identities -Ad0 = Trace(VM0) + Ricd^ 
and — Ad0 = Vt{(P) yield the lemma [3]. □ 

nice concise introduction to the subject of this paragraph is the monograph by Eells and 
Lemaire [3]. Their curvature tensor is minus that presented here, however. Their Ricci tensor is 
the same as that here. 
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The scalar (di/), Ricd^) is simplified in the following lemma. Let Ric*^ be the Ricci 
tensor of (M, g), viewed as a section of Hom(rAjf, TM) and let R^ be the Riemann 
curvature tensor of (N,h). Let be an orthonormal frame on TpM, Ui — d(/)(ej). 
A calculation shows that |3j 

(9) (d</.,Ricd0) = -(d(/.,d0(Ric^O) +^(u„Ri^^„^.u,). 

id 

Since the second term is tensorial in u^, this proves that 

Lemma 2.5. If d<j)\TpM ^d(t>\TpM, then (d0,RiCd0) = (d0, Ric^^) at p. 

Lemma 2.6. tt is harmonic: t{tt) = 0. 

Proof. Since tt o t = ide, and the second fundamental form of t is a quadratic 
form with values in NQ, it follows that Vd7r(d6,d;,) = 0, i.e. Vdvr | T8 vanishes. 
On the other hand, if 9 ^ Q and rj € NgQ, then tt{6 + srj) = tt{9) for all s. 
Therefore VdTr | A^9 vanishes. These two facts show that the trace of VdTr, i.e. 
T(7r), vanishes. □ 

2.4. Integration by Parts. This section recalls the integration-by-parts formula 
following the discussion in j3j . Let : V —>■ M he a vector bundle over the riemann- 
ian m-manifold (Af, g) and let be the space of smooth sections of A^M ^ V, 
i.e. is the space of smooth p-forms on M with values in V. Assume that V is 
equipped with a metric and a compatible connection. There is a natural metric con- 
nection on A^M^V, call it V, which induces an exterior derivation d : A^ — » A^^^ 
by skew-symmetrization. Let d* : A^ A?~^ be the adjoint of d defined by 

(da, p) dvg = / (ct, d*p) dvg + (cr A *p) dvgiQM 

M JM JdM 

for all (T e peA^. Here * : A^ ^ A^^p is the Hodge star operator. 

Lemma 2.7. Let A : A/ — > R 6e a smooth function. Then 



\{da,p)dvg= I {a,d*{\p))dvg+ I \{a ^*p) dvg\gM- 

M JM JdM 

In particular, if X\dM — 0, then 

\{da,p)di'g= / {a,d*{Xp))dvg. 
Jm 

Proof. This follows from applying the definition of d* with p' ~ Xp. □ 

Lemma 2.8. Let d0 G A^ be a 1-form with values in V — (f)*TN and assume that 
X\dM = 0. Then 

d*{Xd<t>) ^ -At(0) -d0(VA). 

Proof. Let be an orthonormal frame on M. If p G A^ and p\dM — 0, then 
d*P = - Ve,p • ei i. Thus, d*[Xdc^) = - AVe.d0 • e, - E. d0(eO • Ve. A. 
The first term equals Xt{(P), while the second term equals d0(VA). □ 
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3. Bayesian Estimators 

In Bayesian statistics, the point 6 is viewed as a random variable with a prior 
density X{e)dO where /g^g, \{e) d6' = 1 {dd = dv^ is the riemannian volume of 
(0,g) ). Recall that the Bayesian risk of a map g is defined to be 

i?,(5;A)= / dist{g{x),-i{e)f\{e)A{x^i{e))dxde. 

JxeE.eee 

A Bayes estimator g : E ^ A is a map which minimizes the Bayesian risk over all 
maps. 

Existence and Uniqueness of the Bayes Estimator. Let us sketch a proof of 
the existence and uniqueness of the Bayes estimator. Define 

(10) h,{x;g,X)= [ dist{g{x),j{9)f \{e)Mx ^ L{0))de. 

One sees that Rg{g;X) — J^^^he{x; g, X) dx. It is clear that a Bayes estimator g^ 
with prior A, if it exists, will have the property that h^{x; g, A) > he(x] g^, A) for all 
X and all estimators g. One may assume that the class of estimators is the set of 
maps between (E, dx) and A. 

By compactness of Q, there is an > and eo > 0, such that for all x € 
E, G 8, and e < eo, the measure ip^ix — L{9))d0 is supported, up to a remainder 
of 0(exp(— 1/e), on the ball of radius about 6 = Tr{x). This observation is trivial 
if X lies within a distance r of O; and it is trivial if x lies in the complement of this 
neighbourhood, since then the measure itself is 0(exp(— 1/e^)). 

Let Br^{0) be the closed ball of radius centred at 6. Possibly after shrinking 
To, the continuity of 7 and compactness of O imply that the image, j(Br^{9)), of 
Br^{9) may be assumed to lie in a closed ball Ds^{'y{9)) of fixed radius So about 

Introduce normal coordinates at 9 and 7(6*) so the above described balls are iso- 
metric to a ball about in a real vector space with an almost euclidean riemannian 
metric of the form ^ ■ dxi dxi + 0(|xp). 

We have therefore shown that he{x]g,X) may be computed, up to a uniform 
remainder term of 0(exp(— 1/e)), using a map between two vector spaces that are 
equipped with metrics that are euclidean up to second order. The techniques used 
in [71 [5] can be used in this situation to show that the Bayes estimator can be 
expanded as a formal power series in e^ and that this estimator is smooth. 

Remark. If (A, h) is a euclidean vector space, the Bayes estimator exists and has 
the explicit form 

This estimator has some rather curious properties: if 7 = t is the inclusion map 
O C E, then g^ is the weighted average of b{9). Because this weighted average need 
not lie on O, one finds that the Bayes estimator is somewhat unsatisfactory. The 
ultimate reason for this is the poor choice of risk functional. 
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4. An Expansion of the Bayesian Risk 

First, introduce a change of variables. 

Lemma 4.1. The map JgiOxE— i>9xE defined by 9 — 9,x = l{9) + ez is a 
diffeomorphism such that 



(12) R,{g;X)= dist(g(t(0) + ez), 7(^))^ A(^^) ^1(2) dz d^. 

Proof. A straightforward calculation. □ 

The expression l(9) + ez equals exp^^g-)(ez) where exp is the exponential map of 
the euclidean space E. The Maclaurin series (equation [5]) implies that 
(13) 

5°exp,(9)(ez) = expg(,(e)) ^ed.g(z) + ^e^Vdg{z, z) + ^e^V'^Ag{z, z, z) + 0{e%f)^ . 

Since A is connected, for each a, 6 G A there is a geodesic c : [0, 1] ^ A such that 
c(0) — a, c(l) = b and the length of c is the distance between a and b. That is, 
\w\a = dist(a, 6) where w = c(0). The tangent vector w = Wa,b is not unique in 
general, but w is a measurable function that is smooth off the the cut locus of a. 

For a = g(i{9)) and b = "f{9), let w = w{9) be the vector Wa,b- The vector w{9) 
is characterized by the property that expg^i^^g^^{w{9)) = 7(0) for all 9 and w{9) is 
a shortest vector amongst all such vectors. The Bayesian estimator : E ^ A is 
written as 

(14) g,{x)=expg^^,){e'g2{x)+0{e')). 

By definition, minimizes the Bayesian risk functional g 1— > i?e(g;A) for each e. 
Since the Bayesian risk functional is an even function of e, the Bayesian estimator 
is, too. 

Lemma 4.2. Let the Bayesian risk — Aq + O(e^). Then 

(15) Ao{g;X)= f \w{9)\^ X{9)d9. 

Jeee 

Consequently, the Bayes estimator g^ satisfies 

(16) ffe o Li9) = exp^(,)(e2g2(.(0))) + 0(6^)) V9. 

Proof. The formula for Aq is straightforward. Since A > a.e. by hypothesis, and 
^0 > 0, it follows that Aq = only ii w = a.e., that is, only if g^=Q o t = 7. Since 
g^^Q = go, equation 1141 implies equation 1161 □ 

Lemma [4.21 shows that 



Corollary 4.3. go o l ~ 

Lemma 4.4. d^^ — dgo + £^dg2 + 0{e'^). 

Proof. Let x £ E and v ^ T^E. It suffices to prove 

dx.9e • V = {d^go + e^d^g2 + 0{e'^)) ■ v. (*) 
The left-hand side of (*) is 

d 



dxgc ■ V 

dt 



exp,(,) {^b{t) + 0(e4)) , (**) 
t=o 
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where a{t) ~ go{x + tv) is a curve in A and b{t) — g2{x + tv) is a curve of tangent 
vectors along a{t). The right-hand side of (**) is the Jacobi field J(s) on (A, h) 
with initial conditions J(0) = a(0) and j(0) = 6(0) + O(e^) at the time s = e^. 
Since J(s) — J(0) + sJ(0) + O(s^) and a(0) = d^^^o • 6(0) — dxg^ ■ v we see that 
(**) implies (*). □ 

Lemma 4.5. Let the Bayesian risk of the Bayesian estimator g^ be = e^A2 + 
e'^A^ + O(e^). If Ak = J akX{0)d9, then the integrand is 

(17) a2 = Idffol' 

„ _ / i|T(5o)|' + i|Vdgo|2 + (dgo,Vr(.g„)-|Ricdc,J + 
^ ^ ' " I |52p + 2(dg2,dgo) + (.92, r(5o)) 

Proof. When one expands gc{i{9) + ez) as a Maclaurin series, one obtains 
(19) exp^(9) (^e^ga + edg,(z) + ie2vd5,(z, z) + ^e^ V^dg,{z, z, z) + 0(e*|z|4; 

Since d^e = dgo + ^^'^92 + 0(e'*) by Lemma the Maclaurin series equals 
(20) 

exp^(e) (^edg,{z) + e^ (^32 + ^Vd5„(z, z)^ + (^g^iz) + ^V^dg,{z, z, z)^ + 0{e^\z\'')^ 
The distance between gc{i-{6) + ez) and 7(6*) expands to 

dist = e2|d5o(z)p + j^j^^ ^ ivd.9o(z, z)p + 2(d5o(z), dg2(2) + '^'^^dg^iz, z, z)) 

(21) +.3(.)+e'^(-) + 0(£'kn, 

where the coefficients on the odd powers of e are odd polynomials in z. Lemmas l2.41 - 
12.51 now implies this lemma. □ 



Recall that tt : N<d ^ is the normal bundle of O in E; the tangent bundle of 
NQ is isometric to TeE while dn is the orthogonal projection of T{NQ) onto TQ. 
CoroUarv 14.31 implies that dgo 1^0 = d7, so on Id^oP > |d7p with equality iff 
dgo\NQ = or dgolTeE = d7od7r. By Lemma l475l these considerations show that 

Proposition 4.6. The Bayesian estimator satisfies 

(1) goOL^j; 

(2) dgo = d(7 7r) on TeE. 

Define F = 7 o tt. The next step is to show that Vdgo = VdF on 0. To do so 
requires that a4 (Lemma 14.51) be simplified. 

Lemma 4.7. Under the standing hypothesis that X > on 0, we have 

(f) /ed0A(0) (d5o,dg2) - /ed0A(0) (<?2, r(7) + d7(Vlog A)); 

(2) <?2 o i = t(7) - iT(go) + d7(V log A). 

(3) a4 = i|VdgoP - |r(7) +d7(VlogA)|2 - |(dr,Ricdr); 

Proof. The inner product A {dgo,dg2) on equals (A ■ d(7 o tt), {dg2) o t) which 
equals (A ■ dj , d{g2i')) ■ The integration- by-parts formula fLemma l2.8p for sections 
of T*0®7*TA yields (f). 
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(1) along with equation [T8l yields 

r |g2 0i-T(7)-iT(5o) + d7(VlogA)|2 + 
04= < (r(5o),r(7)+d7(VlogA)}-|T(7)+d7(VlogA)|2+ 
[ i|Vd5„|2 + (d(7o, Vr(.g„) - fRicdgJ; 

It is clear that is minimized by setting g2 to that in (2). 

Finally, Lemma [^31 implies that (d^o, R-iCdgo) = (dr,RiCdr) on 6. A second 
application of the integration-by-parts formula to A {dgo, VT(go)) proves (3). □ 

Proposition 4.8. The Bayesian estimator satisfies 

(1) Vdgo = Vdr on 9; 

(2) T{go) = t(7) on 6; 

(3) 52 o i = ^'^(7) + d7(V log A); and 

(4) 04 = i|Vdrp - |t(7) +d7(VlogA)|2 - f (dr,Ricdr). 

Proof. By (3) of Lemma l4.71 it is clear that 04 is minimized iff jVdgoP is minimized. 
Let a (resp. (3) be the orthogonal projection of TqE onto T8 (resp. NQ). This 
orthogonal decomposition yields the equality 

iVdffop = |Vdgo(«,a)p +2|Vd5o(a,/?)p + | Vd.g„(/3, 

Since \\7dgo{a, a)]"^ = | Vd5o(dt, dt)|^ and Vd(got) — Vd(7o(dt, dt) -I- dgo • Vdt, 
Proposition yields |Vd.go(Q!, q;)^ = |Vd7|2 = |Vd(77r)(a, q;)!^. Part (4) follows 
from this. 

A Maclaurin series argument shows that Proposition l4.6l impHes that Vdgo{a, (3) = 
Vd(77r)(a;, while |Vd(7o(/37 is unconstrained. This is minimized by = 
|Vd(77r)(/3,/3)|. This proves (1). 

The formula T{go) = ^(7) -t- d7 • t{'k) is implied by (1). Since tt is harmonic by 
Lemma [221 this implies (2). Lemma 14771 part (2) implies (3). □ 



Let us summarize the results of this section. 

Theorem 4.1. Let ge{x) = expg^^^^-j (^e^g2{x) + 0{e^)) be the Bayesian estimator 
for the Bayesian risk functional (Equation\3\) with a fixed Bayesian prior A > 0. 
Then 

(1) for all X G NQ, where 9 — 7r{x) and \x — 9\ < r, 



(2) 

^.(5.; A) j d(?A|d7|2 + 



j dOX |i|Vdr|2 - |t(7) -f d7(VlogA)p - ^ (dL, Ricdr) | + ©(e^), 



where F = 77r. 
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Proof. (1) Let x £ E and ja: — 8| < r. By the hypothesis on the radius r, the 
orthogonal projection of x onto O is well-defined; this orthogonal projection is 
denoted by 6* = 7t{x). Write x = t{9) + ez, where by construction, z e NgO. The 
Maclaurin expansion of at i{9) gives 

/I 



(22) ~g^(x) = ffe oexp^(g)(ez) = exp^^^^^^^ ( edg^ ' ^ + y Vd5e(z,z) + 0{r^e') 



Equation (|16p and Proposition 14.81 3 show that o /,(0) = exp^^^^ where = 
X (5t(7) + d7(Vlog A))^-|-0(e'*). On the other hand, since z e iV^O, Proposition 



shows that 

(23) dg, -z^e^ xdg2-z + 0(re^) 
while Proposition l4.8l l shows that 

(24) Vdg,(z, = X Vd.g2(z, z) + ©(r^e^). 

Equations ([22H24|) imply that gt(x) — exp^^gj(z;e) and that — ^ 0{re^). 
Therefore 

(25) ~g,{x) = exp^(g) (^e^ x Q t(7) + d7(VlogA)j^ + Oire^) 

Since the Bayesian estimator is an even function of e, the error is not 0{re^) but 
must be O(re^). 

(2) This is a straightforward application of the preceding work. 

□ 

Remark. Inspection of the proof above shows that the O(re^) term in Equation 
([25|) is x dg2 ■ z. Thus, the proof also shows that dg2\NQQ vanishes. 



5. Optimal Priors 

In this section we are interested in the behaviour of the mimimax risk which can 
be defined as 

r,(e) = infsup (i?e(<?e,0)-e'M7(e)n, 

9i eee 

where the inf is taken over all possible (sequences of) estimators g^. The problem of 
finding the asymptotic behaviour of re(0) can be derived in a relatively straightfor- 
ward manner from the previous results. Essentially the problem reduces to finding 
optimal priors maximizing the first non-trivial term of the Bayes risk. 

Since in the case of smooth functions the minimax risk r^{Q) is typically of order 
e^, we can define the second- order minimax risk as 

(26) r(e) = liminfsup e"^{R,{g„e) - e^\dj(e)\^). 

Sometimes, a more general minimax risk may be of interest. Let p and q > be 
given function defined on 0. Then equation (j26p can be modified as 

r • f Re{ge,e) ~ e^\d^{e)\-' ~ e^d) 
r (9) = hmmfsup j-7-r- . 
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Even more useful is the following equivalent definition of the second order minimax 
risk 



The advantage of the last formula is that, unlike the previous one, it allows consid- 
eration of smooth functions q{6) > 0, q{6) ^ const. It is thus in this form that the 
second-order minimax risk will be considered below. 

Theorem 1331 part 2, gives a formula for the Bayesian risk expanded up to O(e^) 
of the Bayesian estimator g^. One would like to determine the Bayesian prior 
distribution that minimizes the Bayesian risk. There are a couple interesting twists 
that arise at this point. First, the implicit flat prior is a constant multiple of 
the riemannian volume form d6. However, there is no reason to single out the 
riemannian volume form as the flat prior. Rather, one can introduce the flat prior 
dv = a{9) d9 {a> a.e.) and the Bayesian prior 77(6') dv — \{9) d9, where 77 = A/a. 
Let us stress that the change from d9 to dv, and A to 77, does not change the 
foregoing calculations and results. Second, the minimizcrs of the Bayesian risk 
functional are also the minimizers of the functional 



since the second term is independent oi g. One may, therefore, elect to minimize the 
functional to obtain the Bayesian estimator -which is implicitly a function of 
the Bayesian prior Ad^-and proceed to determine the second-order optimal prior 
by minimizing A 1— > Re{ge', A). Finally, inspection of part (2) of Theorem 14.11 shows 
that one needs tools to understand how to simplify the term |t(7) -f d7(V log A)p. 
The requisite tool is known as sub-riemannian geometry. 

5.1. Sub-riemannian geometry. Let us describe a particular construction of a 

sub-riemannian geometry. Let (M , g) (N, h) be a smooth map, and let Dp = 
kerdpcj)-^ ioi p € M. The collection D — Up Dp is a singular distribution on M. It 
is equipped with an inner product s - a sub-riemannian metric - by declaring that 
dp4>\Dp — > imdp</) C TpN is an isometry. That is, s = (j)*h\D. 

One may think of the subriemannian structure (Z?, s) as a singular distribution 
of directions in which one may travel, along with a metric which allows one to 
measure speed (and angles). Subriemannian structures arise in optimal control 
problems quite frequently [51 [9] . 

One may equivalently characterize the sub-riemannian structure (D, s) by a bun- 
dle map fj, : T*M TM such that (i) is self-adjoint; and (ii) the image of /i 
equals D. In the present context, the map /i is characterized by the identity 



for all smooth functions u,v : M Ti. Equivalently, ^ ■ du = d(^'d(/)(Vu). 

An augmented sub-riemannian structure D = {D, s, dv) is a sub-riemannian 
structure {D, s) plus a volume form dv. The augmented sub-riemannian structure 
permits one to define a sub-laplacian Ajj, which is a second-order, self-adjoint 



(27) 



r*iQ) = M{r\3g : R^g,, 9) < e^\dj{9)\' + e\p{9) + rq{9))}. 



(28) 




/i(dM, dv) = s{Vu, Vv) = (d0(VM), d0(Vf;)), 
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differential operatorQ In local coordinates 



where dv = fdx^ A ■ • ■ A da;'". The sub-laplacian is defined invariantly by 

(30) j u-lS.'Qvdi' = j /i(dM, dw) d:/, 

for all smooth functions that vanish on dM . The self-adjointness of ji implies Aj) 
is self-adjoint. 

If a is a positive function, then let the augmented sub-riemannian structure 
{D,s,a- diy) be denoted by a • £). Equation (|30p shows that the sub-laplacian of the 
augmented sub-riemannian structures differ by a differential operator of first order 

(31) Aa-s = As - ^ ■ dloga. 

5.2. Optimal priors, I. The discussion of sub-riemannian geometry allows the 
expansion of the Bayesian risk fTheorem l4.1[) . The term J d6'A|T(7) -|-d7(V log A)p 
expands to 

(32) J de {uj^\T{-f)f + 4:Lj{T{-f),d-f{\7Lj)) + iujAeuj} , 

where X — ui'^ and (B — {E, s, d9) where E — kerd7^ and s = ^*h\E. Define 

(33) K - i|Vdr|2 - ^ (dr,Ricdr) + |t(7)|' + 2(d7, Vt(7)), 

(34) L = 4Ae + k. 

From this discussion, and an application of the integration-by-parts formula to 
/ d0A(T(7), d7(V log A)), the following is clear. 

Theorem 5.1. The Bayesian risk functional at with prior \ — lo'^ equals 

(35) Re{ge;uj^) =^e^ J d0Lj^\dj\^ + e'^ J dO lo ■ Llj + O^e'^), 
while 

(36) R,{g,;Lj^)^e'^ J d9uj-Luj + 0{e^). 



Define a differential operator TC^ on 8 by 

(37) K:=e^i+|d7p. 

The operator Ti.^ is the Schrodinger operator for a unit-mass particle on 8 in a 
potential field V — \d'y\'^ + e^K with kinetic energy T = ■^{ij{p),p) induced by 
the sub-riemannian metric and Planck constant h — 8e^. From Theorem 15.11 it is 
apparent that R,{g,\uj'^) = e'^ Jd9uj ■ Heto + 0{e^). 



^Warning: the sign of Aj) conflicts with the sign in Montgomery's exposition '§', but it accords 
with the sign convention in riemannian geometry. 
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Theorem 5.2. Let he the largest eigenvalue of He with eigenfunction uj = 
which has J d9uj^ — 1 . Then 

(38) R,{g,;Lu^,) = e^ae + Oie'^). 

Let a be the largest eigenvalue of L with eigenfunction u) normalized so that J dd uj'^ — 
1 . Then 

(39) RS.;^^) = e^o^ + 0{e''), 
and 

(40) r(e) = a. 

Remarks. 1/ In Theorem 15. 2[ it is assumed that He (resp. L) does possess a 
largest eigenvalue. Non-compactness of Q may negate this assumption; it may also 
be negated by properties of the singular distribution E. A reformulation of the 
theorem in the event that He (resp. L) has no largest eigenvalue is clear. 2/ When 
are the eigenvalues of He (resp. L) constant? This depends on the accessibility 
property of the singular distribution E. If sections of E generate TQ under repeated 
Lie brackets, then Hormander has shown that He (resp. L) is hypoelliptic. At 
the opposite extreme, the distribution E might be integrable, in which case the 
eigenvalues of He (resp. L) will vary from leaf to leaf. The optimal prior in this 
latter case is a singular function (a distribution, in the functional-analytic sense) 
concentrated on the leaf with the largest eigenvalue. 3/ The operator He is a 
singular perturbation of a multiplication operator, so one generally cannot naively 
expand ae in a power series. However, when |d7| = ao is constant, the naive idea is 
correct. In this case, one sees that ae = ao + e^a where a is the largest eigenvalue 
of L (modulo the remarks in 1/). 4/ Important special cases include 7 being a 
riemannian submersion or immersion. 

5.3. Optimal priors, II. As noted in the beginning of this section, there is no 
natural reason why one should choose d9 as the flat prior. Let us investigate the 
effect of choosing the flat prior to be dv = a^d9. With arj = uj, one computes from 
equations (I30|31|) that 

4 J dOujAeLU 4 j dv {Ty^ldlogap -I- 2?7^(dloga, dr;) -I- ?7A„2.g?7} , 

(41) =A j dOxa^ X {r;2|dlogap -f jyAg??} , 

where jdlogap = /i(d log a, d log a). Define 

(42) K+ Idlogap, 

(43) L^r; = (4Ac + K J 7?. 

From this discussion and the results of the previous section, the following is clear. 
Theorem 5.3. The Bayesian risk of ge with prior Xd6 = rf'dv (dv — a^d9) equals 

(44) Re{ge;X)^e^ J dv if \d^\'' + j dj, tj ■ LaV + 0{e^), 
while 

(45) Reige-, A) = /di/77 • LaV + O(e'). 
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Since the operator La is self-adjoint with respect to the inner product determined 
by d9, one knows that the prior that maximizes i?e(^e;A) occm'S at a solution to 
the eigenvalue problem 

LaT] = aa'^rj. ($) 

Theorem 5.4. Let a be the largest eigenvalue of the eigenvalue problem ($) with 
eigenfunction rj normalized so that j dvrj^ = 1. Then, with A = a^if' , 

(46) i?,(5,;A) =e4^ + 0(e6), 
and 

(47) r*(e) = a. 
■r*(8) is defined in equation (I27p . 

6. Applications 

There are several cases in which the formulas of Theorem 15.21 yield especially 
nice results. 

6.1. Riemannian immersions. Recall that (j) : (M,g) — > {N,h) is a riemannian 
immersion if (f>*h = g. If 7 : (6,g) — > (-^jh) is a riemannian immersion, then 
the riemannian structure and the induced sub-riemannian structure coincide, while 
d7p = diniG is constant. Remark 3/ following Theorem 15.21 shows that 

Corollary 6.1. Let L — 4A-I-K, where A is the laplacian of (0, g) and n is defined 
in equation i3!!J\) . Let a be the largest eigenvalue of L with eigenfunction tu of unit 
L^-norm. Then = dimQ + f^a, IjJ^ = uj and 

Reige;uje) = {dimQ + a) + 0{e^) , 

There are two interesting special cases of this corollary: when 7 = ide and when 
7 = t (the inclusion map of Q into E). By corollary 16. 1[ the sub-laplacian is the 
same in each case. However, the curvatures of the identity map differ substantially 
from those of the inclusion map. One sees that for x in neighbourhood of Q 



(48) g,{x) 



exp^(,) (2e2v log + 0{e^) if 7 = ide, 

7r(x) + £2 (r(t) + 2Vlog \uj\) + 0{e^) if 7 = t. 



The tension field of the inclusion map r(t) is dim Q times the mean curvature vector 
field - in particular, it is normal to 8 - so gc{x) ^ 9 in the second case. It should 
be noted that w is not the same function in each line. The curvature term k, equals 



(49) 




||r(OP if 7 = ide, 
W)? if 7 = ^. 



While the two estimation problems are incomparable, strictly speaking, it is in- 
teresting to observe that n - and consequently, the dominant eigenvalue of L and 
bayesian risk - is least for the estimator of the inclusion map. This comes with an 
expense: the estimator of u does not take values that are on 8, while the estimator 
of the identity is forced to do so. 

B. Levit, in his unpublished Habilitation thesis, computes k in the case where 
7 = 6. His calculations are carried out in a system of local coordinates, which 
masks the difference between the inclusion and the identity map. The present 
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paper's formalism, based on the Maclaurin series, clarifies these differences and 
explains why the earlier estimator takes values off the manifold Q. The formulas 
in equation 25] sjce proven in the next section. 

6.1.1. Calculations for riemannian immersions. Let {M,g) — ^ {N,h) be a rie- 
mannian immersion. For the present calculations, it may be assumed that TV is a 
riemannian vector bundle over M and (j) is the inclusion of the zero section. The 
projection map N M is denoted by tt. N is naturally identified with the normal 
bundle of M. The tangent bundle to N along M is denoted by TmN = TM ® N. 

Lemma 6.2. Let p e M and x + y,u + v e TpM ® Np. Then 

Vd7r(x + y,u + v) = B> + B'^y, 

where B, : TpM Np is defined by B, — Vd0(», • ) and B', : Np —^ TpM is the 
transposed map. 

Proof. For a smooth vector field x on M, let i be a smooth vector field on N 
that equals x at M. The Levi-Civita connection on M (resp. N) is V (resp. V). 
Since is a riemannian immersion, Wxy = d7r(V5y). From this fact it follows that 
VdTT I TpM vanishes. On the other hand, since Troexp^ | Np = p, Vdn \ Np vanishes. 
Finally, if x e TpM and v S Np, then \/dTr{x,v) = —dTT{\7xv) since d7r(w) = 0. 
Therefore, if z G TpM then 

{z,Vd7r{x,v)) = -(z,d7r(Vj,i;)) = -(z, Va;w) = (V^z, i;), 
= (v, Vjz - Vxz) = (w, Vd(f>{x, z)), 

This completes the proof, since Vdvr is bilinear. □ 

Let us compute the riemannian curvature tensor of AI in terms of that of N and 
the curvature of the immersion (j). From the fact that the Levi-Civita connection 
on M is obtained by orthogonally projecting the connection of N, we have that for 
all vector fields x,y,z on M 

Ky^ = i^y^) - (^-^) " ^l-^y]^^ 

= dn (VidTT ■ VyZ + Vs:\/yZ - VydTr ■ V^z - V^jV^z - Vfj^yj; 

= d7r(R^^z) + (B^B,-B;B,)z, 

where we have used the identity VjdTr-VyZ = B^o(l — d7r)-Vj/Z, since VdTT vanishes 
on the horizontal part. Since (1 — dTr) • V^z — Vd0(?/, z), this demonstrates the 
final line. 

From this equation, it follows that 



(50) 



Ric^^(x) = d^ (^Y.^le^^~^ ^ + E (B^^B., - B^^B,) e. 



where x £ TpM and is an orthonormal basis of TpM. Application of equation lSOl 
to equation [5] yields 

(51) (d0,RiCd0) = |Vd(/.|2-|T((/.)|2. 



18 



BUTLER, LEVIT 



The scalar curvature of Af is the trace of the Ricci tensor, which equals J^i j i^ii e 
b'('/')P ~ l^d^p, where we omit the ~. When (f> is the inclusion l of M into E, we 
see that 

(52) scalM = |T(t)I^ - |Vdt|2. 

To compute {d(j), Vr((/))), note that since r(0) is orthogonal to TpM, (d(/>-ej, Vg^r ((/>)) + 
(Veidcj) ■ ei,T{(j))) vanishes for all i. Therefore 

(53) (d<^,VT(<^)) = -|r(<^)|2. 



Proposition 6.3. [Following the notation of section 4-] Let j be a riemannian 
immersion. Then Ag is the laplacian o/(0,g), and 

(54) ^ = ^ |Vd.p - ^|r(OP - I |Vd7p - i|r(7)r 

If J is totally geodesic (iffS/dj = ifjj{&) is totally geodesic), then 

(55) ^^^|vd.p-^|r(Op. 

Proof From equation [53] and Theorem O k ^ ^\dT\'^ - |(dr,Ricdr) -\t{j)\'^. It 
remains to compute the first two terms. 

(1) Since F = 7 o tt, one sees that VdF = Vd7(d7r, d7r) + d7 • VdTr, which is an 
orthogonal decomposition. Since 7 is a riemannian immersion, 

|VdF|2 = |Vd7p + 2|Vdi|2. 

(2) From equation[9l and the fact that Ric"^ vanishes, (dF, Ricdr) equals j {^i, Ri 
where — d7(ei). Equation [5] and the hypothesis that 7 is a riemannian 
immersion implies that this equals (d7,RiCd7) + scale . Equations 1511 and 
[51] imply that 

(dF, Ricdr) = |Vd7p - |t(7)|2 - |Vd/,|2 + |r(/,)p. 
The two equations prove the formula for k. □ 
Corollary 6.4. The formulas in equation \49\ are correct. 

Proof. 1/ If 7 = ide, then Vd7 = so t{j) — 0, also. 2/ When 7 = t is the 
inclusion map, the equation is clearly correct. □ 

6.2. Riemannian submersions. Recall that M,g N,h is a. riemannian sub- 
mersion iid4>\D : {D,g\D) — * {TN,h) is an isometry where D = (kerd^)"*". In 
this case \d(j)\'^ = dimiV. Equation [§] implies that 

(56) (dcj), Ricd^) = scalo + scaW ° 4>, 

where scal^i is defined to be the trace of Ric^^jZ?. Since \d(j)\ is constant, the identity 
iA|d(^|2 = (d(/),Ricd0) - Vr(0)) - |Vd(?!)p implies that 

(57) |Vd0|2 = scalAT o - scab - (d^, Vr((?!))). 
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Proposition 6.5. [Following the notation of section 4-] Let j be a riemannian 
submersion. Then 

(58) = scalA o 7 + |r(7)p + ^(d7, Vt(7)). 

D 2 

If J is harmonic (iff t{'j) — 0) then 

(59) K = — -scalA0 7. 

Let Ci be an orthonormal frame of Eg. The sublaplacian Ag, when applied to a 
smooth function f, equals 

dimE 

(60) Ae/= ate. 

1=1 

Proof. Equation l561 when applied to the submersion F = 707r, yields (dF, Ricdr) — 
sceIa oF since E is flat. In addition, since |dF| is constant, equation 1571 and lemma 
I2.6l implies that |VdFp = scalAor~(d7, Vr(7)). Equation[33l along with Troi ~ ide, 
implies equation [58l 

For the proof of equation [60l let be a system of normal coordinates centred 
at e and let = ^ + 0{\x\'^) be an orthonormal frame. Assume that Ci — fi 
for i — l,...,dimi?. The bundle map ji : T*Q ^ TQ that characterizes the 
subriemannian structure is written at 6 as 

dime dimB ^ „ 

i i 

Since the riemannian metric g = X]r=i '^^^ ® '^^^ ~^ '^(NP)i where n — dimO, the 
riemannian volume form dd = (1 + 0(|a;|^)) dx^ A • • - Ada;", and so the sub-laplacian 
at 9 is 

dimB „2 

A.= E 



^ (9x^)2' 

which equals X^i^i"^ ^\ & invariant notation. □ 



Remark. If one compares the formulas in Propositions 16.31 and 16.51 one sees that 
both Ag and k depend on the inclusion map i when 7 is a riemannian immersion; 
when 7 is a riemannian submersion Ag and k do not depend on the inclusion map t. 
In the latter case, the geometric information carried by the riemannian submersion 
7 subsumes that carried by l. 
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